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Summary: 

The Affymetrix probeset 231381_at. exhibited an ES/EB ratio of 385. This probeset was 
derived from the Human UniGene cluster Hs.67624. This cluster is composed of 24 
individual EST sequences most of which are derived from ES celt lines and germ cell 
tumors. It is also represented by the Genbank sequence BF223023. This mRNA 
appears to be encoded within the intron of another gene, the voltage-gated calcium 
channel alpha(2)delta-3 subunit gene. This mRNA appears to be transcribed from an 
endogenous human retrovirus which has been spliced to a non-viral sequence at its 3' 
end. This 3' sequence is unique in the genome. This. mRNA does not appear to encode 
a complete protein. The 3* exon of the mRNA shares similarity with the Rat low density 
lipoprotein receptor-related protein 2 however this exon contains stop codons suggesting 
that it may represent a transcript derived from a pseudogene. 
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The Affymetrix probeset 237275_at exhibited an ES/EB ratio of 39. This 
probeset was derived from the Human Unigene cluster Hs.143925 which is also 
represented by the Genbank clone BC026300. This mRNA is composed of five 
exons and is encoded by a gene located on Chromosome 13. This gene 
appears to encode three different related mRNAs represented by BC026300, 
BC018008 and BC017942 as displayed below, but the 237275_at probeset is 
specific for the 3' exon of BC026300. 




The BC026300 mRNA has three exohs that contain repetitive DNA sequences 
and two exons that contains unique DNA sequences that have been conserved 
across the mouse, rat and human genomes. 



Protein: 

The longest open reading within this mRNA is located in the 3! exon containing 
unique sequences. The amino acid sequence of this open reading frame does is 
not similar to any known protein. Comparison of this peptide to the Pfam 
database http://pfam.wustl.edu/ using hmmsearch reveals no similarity to know 
protein motifs. Analysis of this coding sequence with the methods of Fickett 
(NAR 10(17); 5303-5318 (1982)) and Gribskov et al. (NAR 12(1); 539-549 
(1984)) as implemented in The GCG Wisconsin Package applications Testcode 
and CodonPreference suggest that this sequence may not actually encode 
protein. 
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The Affymetrix probeset 219823_at exhibited an ES/EB ratio of 26. This probeset was 
derived from the Human Unigene cluster Hs.86154 which is also represented by the 
RefSeq done NM_024674. This mRNA encodes a 209 amino acid protein (below) that 
is similar to the c. elegans lin-28 RNA-binding protein. 




Protein Motifs: 

Comparison of the protein derived from this sequence to the Pfam database 
http://pfam.wustl.edu/ using hmmsearch reveals the presence of a Cold-shock DNA- 
binding domain sequence and Zinc knuckle domain sequence as diagramed below. The 
presence of these sequences suggest that this protein is involved in binding RNA or 
single stranded DNA. 



CSD zf-CCHC 

Hydropathy: 

Kyte-Doolittle Hydropathy Plot (window size = 9) 

A hydropathy plot of this protein indicates that is a relatively hydrophilic protein with no 
transmembrane domains (see below). 



Kyte-Doolittle Hydropathy Plot 
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The Affymetrix probeset 229661_at exhibited an ES/EB ratio of 23. This probeset was 
derived from the Human Unigene cluster Hs.189095 which is also represented by the 
RefSeq clone NM_02Q436. This mRNA is encoded by a gene located on Chromosome 
20 that has been genetically linked to the congenital ppthalmoplegia 'Duane radial ray 
syndrome'. This mRNA encodes a 1053 amino acid protein known. as SaII4 that is a 
member of a family of proteins that contain a C2H2 zinc finger domain and are thought 
to be transcription factors. 
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Protein Motifs: 

Comparison of the protein derived from this sequence to the Pfam database 
http://pfam.wustl.edu/ using hmmsearch reveals the presence of seven C2H2 zinc finger 
domains as diagramed: 



Hydropathy: 

Kyte-Doolittle Hydropathy Plot (window size = 9) 

A hydropathy plot of this protein indicates that is a relatively hydrophilic protein with no 
transmembrane domains (see below). 
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